Picture for Markus Wulfmeier

Markus Wulfmeier

Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

What Matters for Simulation to Online Reinforcement Learning on Real Robots

Add code
Feb 23, 2026
Viaarxiv icon

Improving cosmological reach of a gravitational wave observatory using Deep Loop Shaping

Add code
Sep 17, 2025
Viaarxiv icon

Value from Observations: Towards Large-Scale Imitation Learning via Self-Improvement

Add code
Jul 09, 2025
Viaarxiv icon

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Add code
Jun 25, 2025
Figure 1 for Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
Figure 2 for Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
Figure 3 for Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
Figure 4 for Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
Viaarxiv icon

The AI Imperative: Scaling High-Quality Peer Review in Machine Learning

Add code
Jun 09, 2025
Viaarxiv icon

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Add code
Apr 22, 2025
Viaarxiv icon

Imitating Language via Scalable Inverse Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 2 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 3 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 4 for Imitating Language via Scalable Inverse Reinforcement Learning
Viaarxiv icon

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Add code
May 03, 2024
Figure 1 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 2 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 3 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Figure 4 for Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Viaarxiv icon

Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution

Add code
Apr 05, 2024
Figure 1 for Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
Figure 2 for Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
Figure 3 for Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
Figure 4 for Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
Viaarxiv icon